NOVEL HUMAN PROTOCADHERIN PROTEINS AND 
POLYNUCLEOTIDES ENCODING THE SAME 



The present application claims the benefit of U.S. 
Provisional Application Number 60/257,257 which was filed on 
5 December 20, 2000 and is herein incorporated by reference in its 
entirety. 

1 . INTRODUCTION 
The present invention relates to the discovery, 
identification, and characterization of novel human 

10 polynucleotides encoding proteins that share sequence similarity 
with mammalian cadherins . The invention encompasses the 
described polynucleotides, host cell expression systems, the 
encoded proteins, fusion proteins, polypeptides and peptides, 
antibodies to the encoded proteins and peptides, and genetically 

15 engineered animals that either lack or overexpress the disclosed 
genes, antagonists and agonists of the proteins, and other 
compounds that modulate the expression or activity of the 
proteins encoded by the disclosed genes, which can be used for 
diagnosis, drug screening, clinical trial monitoring, the 

2 0 treatment of diseases and disorders, and cosmetic or 
nutriceutical applications . 

2 . BACKGROUND OF THE INVENTION 
Cadherin proteins are membrane proteins that have been 
linked to a variety of biological processes varying from 

2 5 development, tumor suppression, neural function, and cell 

communication . 

3 . SUMMARY OF THE INVENTION 
The present invention relates to the discovery, 
identification, and characterization of nucleotides that encode 

3 0 novel human proteins, and the corresponding amino acid sequences 
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w ' of these proteins. The novel human proteins (NHPs) described 

for the first time herein share structural similarity with 
animal protocadherins, and especially the protocadherin FAT. 

The novel human nucleic acid sequences described herein, 
5 encode alternative proteins/open reading frames (ORFs) of 4589, 
3852, 4585, and 4588 amino acids in length (see respectively SEQ 
ID NOS: 2, 4, 6, and 8) . 

The invention also encompasses agonists and antagonists of 
the described NHPs, including small molecules, large molecules, 
10 mutant NHPs, or portions thereof, that compete with native NHP, 

p peptides, and antibodies, as well as nucleotide sequences that 

Q 

^ can be used to inhibit the expression of the described NHPs 

Ul (e.g., antisense and ribozyme molecules, and open reading frame 

ry or regulatory sequence replacement constructs) or to enhance 1 the 

- 1 15 expression of the described NHPs (e.g., expression constructs 

s 

H that place the described polynucleotide under the control of a 

n I 

Mr strong promoter system) , and transgenic animals that express a 

Jz NHP sequence, or "knock-outs" (which can be conditional) that do 

M not express a functional NHP. Knock-out mice can be produced in 

2 0 several ways, one of which involves the use of mouse embryonic 
stem cells ("ES cells") lines that contain gene trap mutations 
in a murine homolog of at least one of the described NHPs. When 
the unique NHP sequences described in SEQ ID NOS: 1-8 are 
"knocked-out " they provide a method of identifying phenotypic 
2 5 expression of the particular gene as well as a method of 

assigning function to previously unknown genes. In addition, 
animals in which the unique NHP sequences described in SEQ ID 
NOS: 1-8 are "knocked-out" provide a unique source in which to 
elicit antibodies to homologous and orthologous proteins which 
30 would have been previously viewed by the immune system as "self" 
and therefore would have failed to elicit significant antibody 
responses . 



2 



LEX-0285-USA 




Additionally, the unique NHP sequences described in SEQ ID 
N0S:l-8 are useful for the identification of protein coding 
sequence and mapping a unique gene to a particular chromosome 
(the gene encoding the described sequences is apparently encoded 
5 on human chromosome 11, see GENBANK accession number AC024231) . 
These sequences identify actual, biologically verified, and 
therefore relevant, exon splice junctions as opposed to those 
that may have been bioinf ormatically predicted from genomic 
sequence alone. The sequences of the present invention are also 
10 useful as additional DNA markers for restriction fragment length 
I polymorphism (RFLP) analysis, and in forensic biology. 

* Further, the present invention also relates to processes 

j 

1 for identifying compounds that modulate, i.e., act as agonists 

* or antagonists, of NHP expression and/or NHP activity that 

115 utilize purified preparations of the described NHPs and/or NHP 

jU product, or cells expressing the same. Such compounds can be 
used as therapeutic agents for the treatment of any of a wide 

variety of symptoms associated with biological disorders or 

D 

imbalances . 

20 

4. DESCRIPTION OF THE SEQUENCE LISTING AND FIGURES 
The Sequence Listing provides the sequences of the NHP ORFs 
encoding the described NHP amino acid sequences . 

2 5 5. DETAILED DESCRIPTION OF THE INVENTION 

The NHPs described for the first time herein are novel 
proteins that may be expressed in, inter alia, human cell lines, 
fetal brain, brain, pituitary, cerebellum, fetal kidney, fetal 
lung, and 6- and 9 -week embryos. 

3 0 The present invention encompasses the nucleotides presented 

in the Sequence Listing, host cells expressing such nucleotides, 
the expression products of such nucleotides, and: (a) 
nucleotides that encode mammalian homologs of the described 
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genes, including the specifically described NHPs, and the NHP 
products; (b) nucleotides that encode one or more portions of 
the NHPs that correspond to functional domains, and the 
polypeptide products specified by such nucleotide sequences, 
5 including but not .limited to the novel regions of any active 

domain(s); (c) isolated nucleotides that encode mutant versions, 
engineered or naturally occurring, of the described NHPs in 
which all or a part of at least one domain is deleted or 
altered, and the polypeptide products specified by such 
; : 10 nucleotide sequences, including but not limited to soluble 
C proteins and peptides in which all or a portion of the signal 

5 5 

O (or hydrophobic transmembrane) sequence is deleted; (d) 

IZ nucleotides that encode chimeric fusion proteins containing all 

fy or a portion of a coding region of an NHP, or one of its domains 

in 15 (e.g., a receptor or ligand binding domain, accessory 
J\ protein/self -association domain, etc. ) fused to another peptide 

or polypeptide; or (e) therapeutic or diagnostic derivatives of 
the described polynucleotides such as oligonucleotides, 
antisense polynucleotides, ribozymes, dsRNA, or gene therapy 
2 0 constructs comprising a sequence first disclosed in the Sequence 
Listing . 

As discussed above, the present invention includes: (a) the 
human DNA sequences presented in the Sequence Listing (and 
vectors comprising the same) and additionally contemplates any 
2 5 nucleotide sequence encoding a contiguous NHP open reading frame 
(ORF) that hybridizes to a complement of a DNA sequence 
presented in the Sequence Listing under highly stringent 
conditions, e.g., hybridization to filter-bound DNA in 0.5 M 
NaHP0 4 , 7% sodium dodecyl sulfate (SDS) , 1 mM EDTA at 65°C, and 
30 washing in 0.1xSSC/0.1% SDS at 68°C (Ausubel F.M. et al . , eds . , 
1989, Current Protocols in Molecular Biology, Vol. I, Green 
Publishing Associates, Inc., and John Wiley & Sons, Inc., NY, at 
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p. 2.10.3) and encodes a functionally equivalent expression 
product . 

.Additionally contemplated are any nucleotide sequences that 
hybridize to the complement of a DNA sequence that encodes and 
expresses an amino acid sequence presented in the Sequence 
Listing under moderately stringent conditions, e.g., washing in 
0.2xSSC/0.1% SDS at 42°C (Ausubel et al . , 1989, supra), yet 
still encodes a functionally equivalent NHP product. Functional 
equivalents of a NHP include naturally occurring NHPs present in 
other species and mutant NHPs whether naturally occurring or 
engineered (by site directed mutagenesis, gene shuffling, • 
directed evolution as described in, for example, U.S. Patent 
Nos. 5,837,458 and 5,723,323 both of which are herein 
incorporated by reference in their entirety) . The invention 
also includes degenerate nucleic acid variants of the disclosed 
NHP polynucleotide sequences. 

Additionally contemplated are polynucleotides encoding NHP 
ORFs, or their functional equivalents, encoded by polynucleotide 
sequences that are about 99, 95, 90, or about 85 percent 
similar or identical to corresponding regions of the nucleotide 
sequences of the Sequence Listing (as measured by BLAST sequence 
comparison analysis using, for example, the GCG sequence 
analysis package using standard default settings) . 

The invention also includes nucleic acid molecules, 
preferably DNA molecules, that hybridize to, and are therefore 
the complements of, the described NHP gene nucleotide sequences. 
Such hybridization conditions may be highly stringent or less 
highly stringent, as described above. In instances where the 
nucleic acid molecules are deoxyoligonucleotides ("DNA oligos" ), 
such molecules are generally about 16 to about 100 bases long, 
or about 2 0 to about 80, or about 34 to about 45 bases long, or 
any variation or combination of sizes represented therein that 
incorporate a contiguous region of sequence first disclosed in 
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the Sequence Listing. Such oligonucleotides can be used in 
conjunction with the polymerase chain reaction (PCR) to screen 
libraries, isolate clones, and prepare cloning and sequencing 
templates, etc. 

Alternatively, such NHP oligonucleotides can be used as 
hybridization probes for screening libraries, and assessing gene 
expression patterns (particularly using a micro array or high- 
throughput "chip" format) . Additionally, a series of the 
described NHP oligonucleotide sequences, or the complements 
thereof, can be used to represent all or a portion of the 
described NHP sequences. An oligonucleotide or polynucleotide 
sequence first disclosed in at least a portion of one or more of 
the sequences of SEQ ID NOS : 1-8 can be used as a hybridization 
probe in conjunction with a solid support matrix/substrate 
(resins, beads, membranes, plastics, polymers, metal or 
metallized substrates, crystalline or polycrystalline 
substrates, etc.). Of particular note are spatially addressable 
arrays (i.e., gene chips, microti ter plates, etc.) of 
oligonucleotides and polynucleotides, or corresponding 
oligopeptides and polypeptides, wherein at least one of the 
biopolymers present on the spatially addressable array comprises 
an oligonucleotide or polynucleotide sequence first disclosed in 
at least one of the sequences of SEQ ID NOS: 1-8, or an amino 
acid sequence encoded thereby. Methods for attaching 
biopolymers to, or synthesizing biopolymers on, solid support 
matrices, and conducting binding studies thereon are disclosed 
in, inter alia, U.S. Patent Nos . 5,700,637, 5,556,752, 
5, 744, 3 05, 4, 631, 211, 5, 445, 934, 5, 252, 743, 4, 713, 32 6, 
5,424,186, and 4,689,405 the disclosures of which are herein 
incorporated by reference in their entirety. 

Addressable arrays comprising sequences first disclosed in 
SEQ ID NOS: 1-8 can be used to identify and characterize the 
temporal and tissue specific expression of a gene. These 
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' addressable arrays incorporate oligonucleotide sequences of 

sufficient length to confer the required specificity, yet be 
within the limitations of the production technology. The length 
of these probes is within a range of between about 8 to about 
5 2000 nucleotides. Preferably the probes consist of 60 
nucleotides and more preferably 2 5 nucleotides from the - 
sequences first disclosed in SEQ ID N0S:l-8. 

For example, a series of the described oligonucleotide 
sequences, or the complements thereof, can be used in chip 
*V10 format to represent all or a portion of the described sequences, 
fri The oligonucleotides, typically between about 16 to about 40 (or 
any whole number within the stated range) nucleotides in length 

rti 

m can partially overlap each other and/or the sequence may be 

represented using oligonucleotides that do not overlap. 
U115 Accordingly, the described polynucleotide sequences shall 

s 

L= typically comprise at least about two or three distinct 

!** oligonucleotide sequences of at least about 8 nucleotides in 

s — Tl, 
3 — 

J3 length that are each first disclosed in the described Sequence 

£T Listing. Such oligonucleotide sequences can begin at any 

20 nucleotide present within a sequence in the Sequence Listing and 
proceed in either a sense (S'-to-S 1 ) orientation vis-a-vis the 
described sequence or in an antisense orientation. 

Microarray-based analysis allows the discovery of broad 
patterns of genetic activity, providing new understanding of 
2 5 gene functions and generating novel and unexpected insight into 
transcriptional processes and biological mechanisms. The use of 
addressable arrays comprising sequences first disclosed in SEQ 
ID NOS:l-8 provides detailed information about transcriptional 
changes involved in a specific pathway, potentially leading to 
30 the identification of novel components or gene functions that 
manifest themselves as novel phenotypes . 

Probes consisting of sequences first disclosed in SEQ ID 
N0S:l-8 can also be used in the identification, selection and 
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validation of novel molecular targets for drug discovery. The 
use of these unique sequences permits the direct confirmation 
of drug targets and recognition of drug dependent changes in 
gene expression that are modulated through pathways distinct 
5 from the drugs intended target . These unique sequences 

therefore also have utility in defining and monitoring both drug 
action and toxicity. 

As an example of utility, the sequences first disclosed in 
SEQ ID N0S:l-8 can be utilized in microarrays or other assay 

• : 10 formats, to screen collections of genetic material from patients 

1== 

y who have a particular medical condition: These investigations 

fU can also be carried out using the sequences first disclosed in 

ml SEQ ID NOS:l-8 in silico and by comparing previously collected 

genetic databases and the disclosed sequences using computer 

5 15 software known to those in the art. 

5 i a 

Thus the sequences first disclosed in SEQ ID NOS:l-8 can be 
^ used to identify mutations associated with a particular disease 
□ and also as a diagnostic or prognostic assay. 

Although the presently described sequences have been 
20 specifically described using nucleotide sequence, it should be 

appreciated that each of the sequences can uniquely be described 
using any of a wide variety of additional structural attributes, 
or combinations thereof. For example, a given sequence can be 
described by the net composition of the nucleotides present 
25 within a given region of the sequence in conjunction with the 
presence of one or more specific oligonucleotide sequence (s) 
first disclosed in the SEQ ID NOS : 1-8. Alternatively, a 
restriction map specifying the relative positions of restriction 
endonuclease digestion sites, or various palindromic or other 
30 specific oligonucleotide sequences can be used to structurally 
describe a given sequence. Such restriction maps, which are 
typically generated by widely available computer programs (e.g., 
the University of Wisconsin GCG sequence analysis package, 
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SEQUENCHER 3 . 0 , Gene Codes Corp., Ann Arbor, MI, etc.), can 
optionally be used in conjunction with one or more discrete 
nucleotide sequence (s) present in the sequence that can be 
described by the relative position of the sequence relative to 
one or more additional sequence (s) or one or more restriction 
sites present in the disclosed sequence. 

For oligonucleotide probes, highly stringent conditions may 
refer, e.g., to washing in 6x SSC/0.05% sodium pyrophosphate at 
37°C (for 14-base oligos), 48°C (for 17-base oligos), 55°C (for 
20-base oligos) , and 60°C (for 23-base oligos) . These nucleic 
acid molecules may encode or act as NHP gene antisense 
molecules, useful, for example, in NHP gene regulation and/or as 
antisense primers in amplification reactions of NHP gene nucleic 
acid sequences. With respect to NHP gene regulation, such 
techniques can be used to regulate biological functions. 
Further, such sequences may be used as part of ribozyme and/or 
triple helix sequences that are also useful for NHP gene 
regulation . 

Inhibitory antisense or double stranded oligonucleotides 
can additionally comprise at least one modified base moiety 
which is selected from the group including but not limited to 
5-f luorouracil , 5-bromouracil , 5-chlorouracil , 5-iodouracil , 
hypoxan thine , xanthine , 4-acetylcy tosine , 

5- (carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl- 
2- thiouridine , 5-carboxymethylaminomethyluracil , dihydrouracil , 
beta-D-galactosylqueosine, inosine , N6-isopentenyladenine , 

1 - methylguanine, 1 -methyl inos ine , 2 , 2-dimethylguanine, 

2 - methy ladenine , 2 -methylguanine , 3 -me thylcy tosine , 
5 -methyl cy tosine, N6- adenine, 7 -methylguanine, 

5-methylaminomethyluracil , 5-methoxyaminomethyl-2- thiouracil , 
beta-D-mannosylqueosine, 5'-methoxycarboxymethyluracil , 
5-methoxyuracil , 2 -methyl thio-N6-isopenteny ladenine, uracil- 5- 
oxyacetic acid (v) , wybutoxosine, pseudouracil , queosine, 
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2- thiocytosine, 5-methyl-2- thiouracil , 2-thiouracil , 
4- thiouracil , 5-methyluracil , uracil-5-oxyacetic acid 
methylester, uracil-5-oxyacetic acid (v) , 5-methyl-2- thiouracil , 

3- (3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 
2 , 6-diaminopurine . 

The antisense oligonucleotide can also comprise at least 
one modified sugar moiety selected from the group including but 
not limited to arabinose, 2-f luoroarabinose, xylulose, and 
hexose . 

In yet another embodiment, the antisense oligonucleotide 
will comprise at least one modified phosphate backbone selected 
from the group including, but not limited to, a 

phosphorothioate, a phosphorodithioate , a phosphoramidothioate , 
a phosphoramidate, a phosphordi ami date, a methylphosphonate, an 
alkyl phosphotriester , and a formacetal or analog thereof. 

In yet another embodiment,, the antisense oligonucleotide is 
an a-anomeric oligonucleotide. An a-anomeric oligonucleotide 
forms specific double- stranded hybrids with complementary RNA in 
which, contrary to the usual (3-units, the strands run parallel 
to each other (Gautier et al . , 1987, Nucl. Acids Res. 25:6625- 
6641) . The oligonucleotide is a 2'-0-methylribonucleotide (Inoue 
et al., 1987, Nucl. Acids Res. 25:6131-6148), or a chimeric RNA- 
DNA analogue (Inoue et al . , 1987, FEBS Lett. 225:327-330). 
Alternatively, double stranded RNA can be used to disrupt the 
expression and function of a targeted NHP. 

Oligonucleotides of the invention can be synthesized by 
standard methods known in the art, e.g. by use of an automated 
DNA synthesizer (such as are commercially available from 
Biosearch, Applied Biosystems, etc.). As examples, 
phosphorothioate oligonucleotides can be synthesized by the 
method of Stein et al . (1988, Nucl. Acids Res. 25:3209), and 
methylphosphonate oligonucleotides can be prepared by use of 
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controlled pore glass polymer supports (Sarin et al . , 1988, 
Proc. Natl. Acad. Sci . U.S.A. 85:7448-7451), etc. 

Low stringency conditions are well-known to those of skill 
in the art, and will vary predictably depending on the specific 
organisms from which the library and the labeled sequences, are 
derived. For guidance regarding such conditions see, for 
example, Sambrook et al . , 1989, Molecular Cloning, A Laboratory 
Manual (and periodic updates thereof) , Cold Spring Harbor Press, 
NY; and Ausubel et al . , 1989, Current Protocols in Molecular 
10 Biology, Green Publishing Associates and Wiley Interscience, NY. 
i' Alternatively, suitably labeled NHP nucleotide probes can 

be used to screen a human genomic library using appropriately 
stringent conditions or by PCR. The identification and 
characterization of human genomic clones is helpful for 
$5 identifying polymorphisms (including, but not limited to, 

nucleotide repeats, microsatellite alleles, single nucleotide 
polymorphisms, or coding single nucleotide polymorphisms) , 
determining the genomic structure of a given locus /allele, and 
Q designing diagnostic tests. For example, sequences derived from 
^0 regions adjacent to the intron/exon boundaries of the human gene 
can be used to design primers for use in amplification assays to 
detect mutations within the exons , introns, splice sites (e.g., 
splice acceptor and/or donor sites), etc., that can be used in 
diagnostics and pharmacogenomics . 
2 5 For example, the present sequences can be used in 

restriction fragment length polymorphism (RFLP) analysis to 
identify specific individuals. In this technique, an 
individual's genomic DNA is digested with one or more 
restriction enzymes, and probed on a Southern blot to yield 
30 unique bands for identification (as generally described in U.S. 
Patent No. 5,272,057, incorporated herein by reference). In 
addition, the sequences of the present invention can be used to 
provide polynucleotide reagents, e.g., PCR primers, targeted to 
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specific loci in the human genome, which can enhance the 
reliability of DNA-based forensic identifications by, for 
example, providing another "identification marker" (i.e., 
another DNA sequence that is unique to a particular individual) . 
5 Actual base sequence information can be used for identification 
as an accurate alternative to patterns formed by restriction 
enzyme generated fragments . 

Further, a NHP gene homolog can be isolated from nucleic 
acid from an organism of interest by performing PCR using two 
10 degenerate or "wobble" oligonucleotide primer pools designed on 
the basis of amino acid sequences within the NHP products 
disclosed herein. The template for the reaction may be total 
RNA, mRNA, and/or cDNA obtained by reverse transcription of mRNA 
prepared from human or non-human cell lines or tissue known or 
*J 15 suspected to express an allele of a NHP gene. The PCR product 
can be subcloned and sequenced to ensure that the amplified 
sequences represent the sequence of the desired NHP gene. The 
PCR fragment can then be used to isolate a full length cDNA 
clone by a variety of methods. For example, the amplified 
ff 2 0 fragment can be labeled and used to screen a cDNA library, such 
as a bacteriophage cDNA library. Alternatively, the labeled 
fragment can be used to isolate genomic clones via the screening 
of a genomic library. 

PCR technology can also be used to isolate full length cDNA 
25 sequences. For example, RNA can be isolated, following standard 
procedures, from an appropriate cellular or tissue source (i.e., 
one known, or suspected, to express a NHP gene) . A reverse 
transcription (RT) reaction can be performed on the RNA using an 
oligonucleotide primer specific for the most 5' end of the 
30 amplified fragment for the priming of first strand synthesis. 
The resulting RNA /DNA hybrid may then be "tailed" using a 
standard terminal transferase reaction, the hybrid may be 
digested with RNase H, and second strand synthesis may then be 
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primed with a complementary primer. Thus, cDNA sequences 
upstream of the amplified fragment can be isolated. For a 
review of cloning strategies that can be used, see e.g., 
Sambrook et al . , 1989, supra. 
5 A cDNA encoding a mutant NHP sequence can be isolated, for 

example, by using PCR. In this case, the first cDNA strand may 
be synthesized by hybridizing an oligo-dT oligonucleotide to 
mRNA isolated from tissue known or suspected to be expressed in 
an individual putatively carrying a mutant NHP allele, and by 
10 extending the new strand with reverse transcriptase. The second 
strand of the cDNA is then synthesized using an oligonucleotide 
Q that hybridizes specifically to the 5' end of the normal 
sequence. Using these two primers, the product is then 
amplified via PCR, optionally cloned into a suitable vector, and 
pyl5 subjected to DNA sequence analysis through methods well-known to 
fcs those of skill in the art. By comparing the DNA sequence of the 

s 

H= mutant NHP allele to that of a corresponding normal NHP allele, 

fy 

LI the mutation (s) responsible for the loss or alteration of 
~f function of the mutant NHP gene product can be ascertained. 

e a 

M-20 Alternatively, a genomic library can be constructed using 

DNA obtained from an individual suspected of or known to carry a 
mutant NHP allele {e.g., a person manifesting a NHP-associated 
phenotype such as, for example, obesity, high blood pressure, 
connective tissue disorders, infertility, etc.), or a cDNA 

25 library can be constructed using RNA from a tissue known, or 
suspected, to express a mutant NHP allele. A normal NHP gene, 
or any suitable fragment thereof, can then be labeled and used 
as a probe to identify the corresponding mutant NHP allele in 
such libraries. Clones containing mutant NHP sequences can then 

30 be purified and subjected to sequence analysis according to 
methods well-known to those skilled in the art. 

Additionally, an expression library can be constructed 
utilizing cDNA synthesized from, for example, RNA isolated from 
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a tissue known, or suspected, to express a mutant NHP allele in 
an individual suspected of or known to carry such a mutant 
allele. In this manner, gene products made by the putatively 
mutant tissue can be expressed and screened using standard 
antibody screening techniques in conjunction with antibodies 
raised against a normal NHP product, as described below. For 
screening techniques, see, for example, Harlow, E. and Lane, 
eds., 1988, "Antibodies: A Laboratory Manual", Cold Spring 
Harbor Press, Cold Spring Harbor, NY. 

Additionally, screening can be accomplished by screening 
with labeled NHP fusion proteins, such as, for example, alkaline 
phosphatase-NHP or NHP-alkaline phosphatase fusion proteins. In 
cases where a NHP mutation results in an expression product with 
altered function (e.g., as a result of a missense or a 
frameshift mutation) , polyclonal antibodies to NHP are likely to 
cross-react with a corresponding mutant NHP expression product. 
Library clones detected via their reaction with such labeled 
antibodies can be purified and subjected to sequence analysis 
according to methods well-known in the art. 

The invention also encompasses (a) DNA vectors that contain 
any of the foregoing NHP coding sequences and/or their 
complements (i.e., antisense) ; (b) DNA expression vectors that 
contain any of the foregoing NHP coding sequences operatively 
associated with a regulatory element that directs the expression 
of the coding sequences (for example, baculovirus as described 
in U.S. Patent No. 5,869,336 herein incorporated by reference) ; 
(c) genetically engineered host cells that contain any of the 
foregoing NHP coding sequences operatively associated with a 
regulatory element that directs the expression of the coding 
sequences in the host cell; and (d) genetically engineered host 
cells that express an endogenous NHP sequence under the control 
of an exogenously introduced regulatory element (i.e., gene 
activation) . As used herein, regulatory elements include, but 
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are not limited to, inducible and non-inducible promoters, 
enhancers, operators and other elements known to those skilled 
in the art that drive and regulate expression. Such regulatory 
elements include but are not limited to the cytomegalovirus 
(hCMV) immediate early gene, regulatable, viral elements 
(particularly retroviral LTR promoters) , the early or late 
promoters of SV40 adenovirus, the lac system, the trp system, 
the TAC system, the TRC system, the major operator and promoter 
regions of phage lambda, the control regions of fd coat protein, 
the promoter for 3-phosphoglycerate kinase (PGK) , the promoters 
of acid phosphatase, and the promoters of the yeast a-mating 
factors. 

The present invention also encompasses antibodies and anti- 
idiotypic antibodies (including Fab fragments) , antagonists and 
agonists of a NHP, as well as compounds or nucleotide constructs 
that inhibit expression of a NHP sequence (transcription factor 
inhibitors, antisense and ribozyme molecules, or open reading 
frame sequence or regulatory sequence replacement constructs) , 
or promote the expression of a NHP (e.g., expression constructs 
in which NHP coding sequences are operatively associated with 
expression control elements such as promoters, 
promoter /enhancers , etc.) . 

The NHPs or NHP peptides, NHP fusion proteins, NHP 
nucleotide sequences, antibodies, antagonists and agonists can 
be useful for the detection of mutant NHPs or inappropriately 
expressed NHPs for the diagnosis of disease. The NHP proteins 
or peptides, NHP fusion proteins, NHP nucleotide sequences, host 
cell expression systems, antibodies, antagonists, agonists and 
genetically engineered cells and animals can be used for 
screening for drugs (or high throughput screening of 
combinatorial libraries) effective in the treatment of the 
symptomatic or phenotypic manifestations of perturbing the 
normal function of NHP in the body. The use of engineered host 
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cells and/or animals may offer an advantage in that such systems 
allow not only for the identification of compounds that bind to 
the endogenous receptor for an NHP, but can also identify 
compounds that trigger NHP-mediated activities or pathways. 

Finally, the NHP products can be used as therapeutics. For 
example, soluble derivatives such as NHP peptides /domains 
corresponding to NHPs, NHP fusion protein products (especially 
NHP-Ig fusion proteins, i.e., fusions of a NHP, or a domain of a 
NHP, to an IgFc) , NHP antibodies and anti-idiotypic antibodies 
(including Fab fragments) , antagonists or agonists (including 
compounds that modulate or act on downstream targets in a NHP- 
mediated pathway) can be used to directly treat diseases or 
disorders. For instance, the administration of an effective 
amount of soluble NHP, or a NHP-IgFc fusion protein or an anti- 
idiotypic antibody (or its Fab) that mimics the NHP could 
activate or effectively antagonize the endogenous NHP receptor. 
Nucleotide constructs encoding such NHP products can be used to 
genetically engineer host cells to express such products in 
vivo; these genetically engineered cells function as 
"bioreactors" in the body delivering a continuous supply of a 
NHP, a NHP peptide, or a NHP fusion protein to the body. 
Nucleotide constructs encoding functional NHPs, mutant NHPs, as 
well as antisense and ribozyme molecules can also be used in 
"gene therapy" approaches for the modulation of NHP expression. 
Thus, the invention also encompasses pharmaceutical formulations 
and methods for treating biological disorders. 

Various aspects of the invention are described in greater 
detail in the subsections below. 
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5 . 1 THE NHP SEQUENCES 
The cDNA sequences and the corresponding deduced amino acid 
sequences of the described NHPs are presented in the Sequence 
Listing. The NHP nucleotides were obtained from clustered human 
5 ESTs, and cDNAs made from brain mRNA (Edge Biosystems, 
Gaithersburg, MD) . 

Several polymorphisms were identified including an A/T 
polymorphism at the nucleotide position represented by, for 
example, position 4543 of SEQ ID NO : 1 (which can result in a thr 
10 or ser at the region corresponding to amino acid (aa) position 

Hb 1515 of, for example, SEQ ID NO : 2 ) , an A/G polymorphism at 

o 

m nucleotide position 4775 (which can result in an asp or gly at 

FU aa position 1592), an A/G polymorphism at the nucleotide 

in 

fy position represented by, for example, position 6878 of SEQ ID 

n\ 

[515 N0:1 (which can result in an asn or ser at the region 

y I 

s corresponding to amino acid (aa) position 2293 of, for example, 

pj SEQ ID NO:2), a G/C polymorphism at nucleotide position 7227 

(which can result in an arg or pro at aa position 2409) , a G/A 
Q polymorphism at the nucleotide position represented by, for 
^20 example, position 8263 of SEQ ID NO : 1 (which can result in a val 
or ile at the region corresponding to amino acid (aa) position 
2755 of, for example, SEQ ID NO : 2 ) , a G/A polymorphism at 
nucleotide position 10552 (which can result in val or leu at aa 
position 3518 of, for example, SEQ ID NO : 2 ) , a G/A polymorphism 
25 at nucleotide position 11434 (which can result in a gly or ser 
at aa position 3812), a C/A polymorphism at the nucleotide 
position represented by, for example, position 12691 of SEQ ID 
NO:l (which can result in a pro or thr at the region 
corresponding to amino acid (aa) position 4231 of, for example, 
30 SEQ ID NO:2), a G/A polymorphism at nucleotide position 12770 
(which can result in a gly or glu at aa position 4257 of, for 
example, SEQ ID NO:2), and a C/G polymorphism at the nucleotide 
position represented by, for example, position 12820 of SEQ ID 
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N0:1 (which can result in a leu or val at the region 
corresponding to amino acid (aa) position 4274 of, for example, 
SEQ ID NO: 2) . 

The disclosed NHPs are apparently encoded on human 
5 chromosome 11 (or possibly human chromosome 8) . 

The described novel human polynucleotide sequences can be 
used, among other things, in the molecular mutagenesis/evolution 
of proteins that are at least partially encoded by the described 
novel sequences using, for example, polynucleotide shuffling or 
10 related methodologies. Such approaches are described in U.S. 
Patent Nos . 5,830,721 and 5,837,458 which are herein 
O incorporated by reference in their entirety. 

p NHP gene products can also be expressed in transgenic 

Jj] animals. Animals of any species ,' including, but not limited to, 

fyl5 worms, mice, rats, rabbits, guinea pigs, pigs, micro-pigs, 

birds, goats, and non-human primates, e.g., baboons, monkeys, 

£ 

^* and chimpanzees may be used to generate NHP transgenic animals. 
!=£ Any technique known in the art may be used to introduce a 

2? NHP transgene into animals to produce the founder lines of 
H20 transgenic animals. Such techniques include, but are not 

limited to pronuclear microinjection (Hoppe, P.C. and Wagner, 
T.E., 1989, U.S. Patent No. 4,873,191); retrovirus-mediated gene 
transfer into germ lines (Van der Putten et al . , 1985, Proc . 
Natl. Acad. Sci . , USA 82:6148-6152); gene targeting in embryonic 
25 stem cells (Thompson et al . , 1989, Cell 56:313-321); 

electroporation of embryos (Lo, 1983, Mol Cell. Biol. 3:1803- 
1814); and sperm-mediated gene transfer (Lavitrano et al . , 1989, 
Cell 57:717-723) ; etc. For a review of such techniques, see 
Gordon, 1989, Transgenic Animals, Intl. Rev. Cytol . 115:171-229, 
3 0 which is incorporated by reference herein in its entirety. 

The present invention provides for transgenic animals that 
carry the NHP transgene in all their cells, as well as animals 
which carry the transgene in some, but not all their cells, 
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i.e., mosaic animals or somatic cell transgenic animals. The 
transgene may be integrated as a single transgene or in 
concatamers, e.g., head-to-head tandems or head-to-tail tandems. 
The . transgene may also be selectively introduced into and 
5 activated in a particular cell-type by following, for example, 
the teaching of Lasko et al . , 1992, Proc . Natl. Acad. Sci . USA 
85:6232-6236. The regulatory sequences required for such a 
cell-type specific activation will depend upon the particular 
cell- type of interest, and will be apparent to those of skill in 
10 the art. 

M When it is desired that a NHP transgene be integrated into 

o 

□ the chromosomal site of the endogenous NHP gene, gene targeting 

=5 5 

i:~ is preferred. Briefly, when such a technique is to be utilized, 
rtJ vectors containing some nucleotide sequences homologous to the 
lf|15 endogenous NHP gene are designed for the purpose of integrating, 
via homologous recombination with chromosomal sequences, into 

La. 

and disrupting the function of the nucleotide sequence of the 
endogenous NHP gene (i.e., "knockout" animals). 

The transgene can also be selectively introduced into a 
20 particular cell- type, thus inactivating the endogenous NHP gene 
in only that cell-type, by following, for example, the teaching 
of Gu et al . , 1994, Science, 265:103-106. The regulatory 
sequences required for such a cell-type specific inactivation 
will depend upon the particular cell-type of interest, and will 
25 be apparent to those of skill in the art. 

Once transgenic animals have been generated, the expression 
of the recombinant NHP gene may be assayed utilizing standard 
techniques. Initial screening may be accomplished by Southern 
blot analysis or PCR techniques to analyze animal tissues to 
3 0 assay whether integration of the transgene has taken place. The 
level of mRNA expression of the transgene in the tissues of the 
transgenic animals may also be assessed using techniques which 
include but are not limited to Northern blot analysis of tissue 
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samples obtained from the animal, in situ hybridization 
analysis, and RT-PCR. Samples of NHP gene-expressing tissue, 
may also be evaluated immunocytochemically using antibodies 
specific for the NHP transgene product. 
5 The present invention provides for "knockin" animals. 

Knockin animals are those in which a gene that the animal does 
not naturally have in its genome, is inserted. For example, 
when a human gene is used to replace its murine ortholog in the 
mouse. Such knockin animals are useful for the in vivo study, 
10 testing and validation of, intra alia, human drug targets as 

ju*. well as for compounds that are directed at the same. 

O 

ft! 5.2 NHPS AND NHP POLYPEPTIDES 

S = NHPs, polypeptides, peptide fragments, mutated, truncated, 

Fills or deleted forms of the NHPs, and/or NHP fusion proteins can be 

s US 

EJ = 

s prepared for a variety of uses. These uses include but are not 

JTJ limited to the generation of antibodies, as reagents in 
H diagnostic assays, the identification of other cellular gene 
f=l products related to a NHP, as reagents in assays for screening 
^20 for compounds that can be used as pharmaceutical reagents useful 
in the therapeutic treatment of mental, biological, or medical 
disorders and diseases. Given the similarity information and 
expression data, the described NHPs can be targeted (by drugs, 
oligos, antibodies, etc, ) in order to treat disease, or to 
25 therapeutically augment the efficacy of, for example, 

chemotherapeutic agents used in the treatment of cancer. 

The Sequence Listing discloses the amino acid sequences 
encoded by the described NHP genes. The NHPs typically display 
initiator methionines in DNA sequence contexts consistent with a 
30 translation initiation site, and a signal like sequences near 
their N- terminal ends as typical of many other membrane 
proteins . 
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The NHP amino acid sequences of the invention include the 
amino acid sequence presented in the Sequence Listing as well as 
analogues and derivatives thereof. Further, corresponding NHP 
homologues from other species are encompassed by the invention. 
5 In fact, any NHP protein encoded by the NHP nucleotide sequences 
described above are within the scope of the invention, as are 
any novel polynucleotide sequences encoding all or any novel 
portion of an amino acid sequence presented in the Sequence 
Listing. The degenerate nature of the genetic code is well- 
10 known, and, accordingly, each amino acid presented in the 

Sequence Listing, is generically representative of the well- 
known nucleic acid ''triplet" codon, or in many cases codons, 
that can encode the amino acid. As such, as contemplated 
herein, the amino acid sequences presented in the Sequence 



pjl5 Listing, when taken together with the genetic code (see, for 
y * example, Table 4-1 at page 109 of "Molecular Cell Biology", 
M* 1986, J. Darnell et al . eds . , Scientific American Books, New 

ru 

LI York, NY, herein incorporated by reference) are generically 
■« representative of all the various permutations and combinations 
y,20 of nucleic acid sequences that can encode such amino acid 
sequences . 

The invention also encompasses proteins that are 
functionally equivalent to the NHPs encoded by the presently 
described nucleotide sequences as judged by any of a number of 

25 criteria, including', but not limited to, the ability to bind and 
cleave a substrate of a NHP, or the ability to effect an 
identical or complementary downstream pathway, or a change in 
cellular metabolism (e.gr., proteolytic activity, ion flux, 
tyrosine phosphorylation, etc.). Amino acid substitutions may 

30 be made on the basis of similarity in polarity, charge, 
s o lubi 1 i ty , hydr ophobi c i ty , hydr ophi 1 i c i ty , and / or the 
amphipathic nature of the residues involved. For example, 
nonpolar (hydrophobic) amino acids include alanine, leucine, 
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isoleucine, valine, proline, phenylalanine, tryptophan, and 
methionine; polar neutral amino acids include glycine, serine, 
threonine, cysteine, tyrosine, asparagine, and glutamine; 
positively charged (basic) amino acids include arginine, lysine, 
5 and histidine; and negatively charged (acidic) amino acids 
include aspartic acid and glutamic acid. 

A variety of host-expression vector systems can be used to 
express the NHP nucleotide sequences of the invention. Where, 
as in the present instance, the NHP peptide or polypeptide is 
10 thought to be membrane protein, the hydrophobic regions of the 
protein can be excised and the resulting soluble peptide or 

y 

D polypeptide can be recovered from the culture media. Such 

ry 

\p expression systems also encompass engineered host cells that 

[~ express a NHP, or functional equivalent, in situ. Purification 

U1 15 or enrichment of a NHP from such expression systems can be 

accomplished using appropriate detergents and lipid micelles and 
l y methods well-known to those skilled in the art. However, such 

go engineered host cells themselves may be used in situations where 

O 

r; it is important not only to retain the structural and functional 

20 characteristics of the NHP, but to assess biological activity, 
e.g., in certain drug screening assays. 

The expression systems that may be used for purposes of the 
invention include, but are not limited to, microorganisms such 
as bacteria {e.g., E. coli, B. subtilis) transformed with 
2 5 recombinant bacteriophage DNA, plasmid DNA or cosmid DNA 

expression vectors containing NHP nucleotide sequences; yeast 
(e.g., Saccharomyces , Pichia) transformed with recombinant yeast 
expression vectors containing NHP nucleotide sequences; insect 
cell systems infected with recombinant virus expression vectors 
30 (e.g., baculovirus) containing NHP nucleotide sequences; plant 
cell systems infected with recombinant virus expression vectors 
(e.g., cauliflower mosaic virus, CaMV; tobacco mosaic virus, 
TMV) or transformed with recombinant plasmid expression vectors 
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(e.g., Ti plasmid) containing NHP nucleotide sequences; or 
mammalian cell systems {e.g., COS, CHO, BHK, 2 93, 3T3) harboring 
recombinant expression constructs containing NHP nucleotide 
sequences and promoters derived from the genome of mammalian 
5 cells (e.g., metallothionein promoter) or from mammalian viruses 

(e.g., the adenovirus late promoter; the vaccinia virus 7 . 5K 
promoter) . 

In bacterial systems, a number of expression vectors may be 
advantageously selected depending upon the use intended for the 
10 NHP product being expressed. For example, when a large quantity 
of such a protein is to be produced for the generation of 
pharmaceutical compositions of or containing NHP, or for raising 
antibodies to a NHP, vectors that direct the expression of high 



y] levels of fusion protein products that are readily purified may 



, hi 



ry 15 be desirable. Such vectors include, but are not limited, to the 



If! 



E . coli expression vector pUR278 (Ruther et al . , 1983, EMBO J. 
2:1791), in which a NHP coding sequence may be ligated 



ru 

M, individually into the vector in frame with the la.cZ coding 
X region so that a fusion protein is produced; pIN vectors (Inouye 
H20 & Inouye, 1985, Nucleic Acids Res. 13:3101-3109; Van Heeke & 
Schuster, 1989, J. Biol. Chem. 254:5503-5509); and the like. 
pGEX vectors (Pharmacia or American Type Culture Collection) can 
also be used to express foreign polypeptides as fusion proteins 
with glutathione S- transferase (GST) . In general, such fusion 
25 proteins are soluble and can easily be purified from lysed cells 
by adsorption to glutathione-agarose beads followed by elution 
in the presence of free glutathione. The PGEX vectors are 
designed to include thrombin or factor Xa protease cleavage^ 
sites so that the cloned target expression product can be 
3 0 released from the GST moiety. 

In an insect system, Autographa calif ornica nuclear 
polyhedrosis virus (AcNPV) is used as a vector to express 
foreign polynucleotide sequences. The virus grows in Spodoptera 
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frugiperda cells. A NHP coding sequence can be cloned 
individually into non-essential regions (for example the 
polyhedrin gene) of the virus and placed under control of an 
AcNPV promoter (for example the polyhedrin promoter) . 
Successful insertion of NHP coding sequence will result in 
inactivation of the polyhedrin gene and production of non- 
occluded recombinant virus (i.e., virus lacking the 
proteinaceous coat coded for by the polyhedrin gene) . These 
recombinant viruses are then used to infect Spodoptera 
frugiperda cells in which the inserted sequence is expressed 
(e.g., see Smith efc al . , 1983, J. Virol. 46: 584; Smith, U.S. 
Patent No. 4,215,051). 

In mammalian host cells, a number of viral-based expression 
systems may be utilized. In cases where an adenovirus is used 
as an expression vector, the NHP nucleotide sequence of interest 
may be ligated to an adenovirus transcription/ translation 
control complex, e.g., the late promoter and tripartite leader 
sequence. This chimeric sequence may then be inserted in the 
adenovirus genome by in vitro or in vivo recombination. Inser- 
tion in a non-essential region of the viral genome (e.g., region 
El or E3 ) will result in a recombinant virus that is viable and 
capable of expressing a NHP product in infected hosts (e.g., 
See Logan & Shenk, 1984, Proc . Natl. Acad. Sci . USA 81:3655- 
3659). Specific initiation signals may also be required for 
efficient translation of inserted NHP nucleotide sequences. 
These signals include the ATG initiation codon and adjacent 
sequences. In cases where an entire NHP gene or cDNA, including 
its own initiation codon and adjacent sequences, is inserted 
into the appropriate expression vector, no additional 
translational control signals may be needed. However, in cases 
where only a portion of a NHP coding sequence is inserted, 
exogenous translational control signals, including, perhaps, the 
ATG initiation codon, must be provided. Furthermore, the 
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initiation codon must be in phase with the reading frame of the 
desired coding sequence to ensure translation of the entire 
insert. These exogenous translational control signals and 
initiation codons can be of a variety of origins, both natural 
5 and synthetic. The efficiency of expression may be enhanced by 
the inclusion of appropriate transcription enhancer elements, 
transcription terminators, etc. (See Bitter et al . , 1987, 
Methods in Enzymol . 153:516-544). 

In addition, a host cell strain may be chosen that 
f'10 modulates the expression of the inserted sequences, or modifies 
=__, and processes the expression product in the specific fashion 

r— 

desired. Such modifications (e.g., glycosylation) and 
fl| processing {e.g., cleavage) of protein products may be important 

in 

rii for the function of the protein. Different host cells have 

m 15 characteristic and specific mechanisms for the post- transla- 

yi 

e tional processing and modification of proteins and expression 

La ^ 

hr. products. Appropriate cell lines or host systems can be chosen 

s y 

rf to ensure the correct modification and processing of the foreign 

yrj 

protein expressed. To this end, eukaryotic host cells which 
2 0 possess the cellular machinery for proper processing of the 

primary transcript, glycosylation, and phosphorylation of the 
expression product may be used. Such mammalian host cells 
include, but are not limited to, CHO, VERO, BHK, HeLa, COS, 
MDCK, 293, 3T3, WI38, and in particular, human cell lines. 

2 5 For long-term, high-yield production of recombinant 
proteins, stable expression is preferred. For example, cell 
lines which stably express the NHP sequences described above can 
be engineered. Rather than using expression vectors which 
contain viral origins of replication, host cells can be 

3 0 transformed with DNA controlled by appropriate expression 
control elements (e.g., promoter, enhancer sequences, 
transcription terminators, polyadenylation sites, etc.), and a 
selectable marker. Following the introduction of the foreign 
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DNA, engineered cells may be allowed to grow for 1-2 days in an 
enriched media, and then are switched to a selective media. The 
selectable marker in the recombinant plasmid confers resistance 
to the selection and allows cells to stably integrate the 
plasmid into their chromosomes and grow to form foci which in 
turn can be cloned and expanded into cell lines. This method 
may advantageously be used to engineer cell lines which express 
the NHP product. Such engineered cell lines may be particularly 
useful in screening and evaluation of compounds that affect the 
endogenous activity of the NHP product. 

A number of selection systems may be used, including but 
not limited to the herpes simplex virus thymidine kinase (Wigler 
et al., 1977, Cell 22:223), hypoxanthine-guanine 

phosphoribosyltransf erase (Szybalska and Szybalski, 19 62, Proc. 
Natl. Acad. Sci . USA 48:2026), and adenine 

phosphoribosyltransf erase (Lowy et al . , 1980, Cell 22:817) 
genes, which can be employed in tk", hgprt" or aprt" cells, 
respectively. Also, antimetabolite resistance can be used as 
the basis of selection for the following genes: dhfr, which 
confers resistance to methotrexate (Wigler et al . , 1980, Proc. 
Natl. Acad. Sci. USA 77:3567; O ' Hare et al . , 1981, Proc. Natl. 
Acad. Sci. USA 78:1527); gpt, which confers resistance to 
mycophenolic acid (Mulligan and Berg, 1981, Proc. Natl. Acad. 
Sci. USA 78:2072); neo, which confers resistance to the 
aminoglycoside G-418 (Colberre-Garapin et al . , 1981, J. Mol . 
Biol. 250:1); and hygro, which confers resistance to hygromycin 
(Santerre et al . , 1984, Gene 30:147). 

Alternatively, any fusion protein can be readily purified 
by utilizing an antibody specific for the fusion protein being 
expressed. For example, a system described by Janknecht et al . 
allows for the , ready purification of non-denatured fusion 
proteins expressed in human cell lines (Janknecht, et al . , 1991, 
Proc. Natl. Acad. Sci. USA 58:8972-8976). In this system, the 
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sequence of interest is subcloned into a vaccinia recombination 
plasmid such that the sequence's open reading frame is 
translationally fused to an amino- terminal tag consisting of six 
histidine residues. ' Extracts from cells infected with 
5 recombinant vaccinia virus are loaded onto Ni 2+ • nitriloacetic 
acid-agarose columns and histidine- tagged proteins are 
selectively eluted with imidazole-containing buffers. 

Also encompassed by the present invention are fusion 
proteins that direct the NHP to a target organ and/or 
10 facilitate transport across the membrane into the cytosol . 
tmJ Conjugation of NHPs to antibody molecules or their Fab fragments 

Q. could be used to target cells bearing a particular epitope. 

O 

ry Attaching the appropriate signal sequence to the NHP would also 

i n 

if; transport the NHP to the desired location within the cell. 

IfUlS Alternatively targeting of NHP or its nucleic acid sequence 

LTJ 

might be achieved using liposome or lipid complex based 



delivery systems . Such technologies are described in 
"Liposomes : A Practical Approach", New, R.R.C., ed. , Oxford 
University Press, New York and in U.S. Patent Nos. 4,594,595, 



M=20 5,459,127, 5,948,767 and 6,110,490 and their respective 



disclosures which are herein incorporated by reference in their 
entirety. Additionally embodied are novel protein constructs 
engineered in such a way that they facilitate transport of the 
NHP to the target site or desired organ, where they cross the 

2 5 cell membrane and/or the nucleus where the NHP can exert its 

functional activity. This goal may be achieved by coupling of 
the NHP to a cytokine or other ligand that provides targeting 
specificity, and/or to a protein transducing domain (see 
generally U.S. applications Ser. No. 60/111,701 and 60/056,713, 

3 0 both of which are herein incorporated by reference, for examples 

of such transducing sequences) to facilitate passage across 
cellular membranes and can optionally be engineered to include 
nuclear localization . 
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5 . 3 ANTIBODIES TO NHP PRODUCTS 
Antibodies that specifically recognize one or more epitopes 
of a NHP, or epitopes of conserved variants of a NHP, or peptide 
fragments of a NHP are also encompassed by the invention. Such 
5 antibodies include but are not limited to polyclonal antibodies, 
monoclonal antibodies (mAbs) , humanized or chimeric antibodies, 
single chain antibodies, Fab fragments, F(ab*) 2 fragments, 
fragments produced by a Fab expression library, anti-idiotypic 
(anti-Id) antibodies, and epitope-binding fragments of any of 
10 the above. 

jjf The antibodies of the invention may be used, for example, 

O in the detection of NHP in a biological sample and may, 

e~ therefore, be utilized as part of a diagnostic or prognostic 

•sir 5 

TU technique whereby patients may be tested for abnormal amounts of 
yi 15 NHP. Such antibodies may also be utilized in conjunction with, 
* a for example, compound screening schemes for the evaluation of 

HI the effect of test compounds on expression and/or activity of a 

[ft NHP expression product. Additionally, such antibodies can be 
P used in conjunction gene therapy to, for example, evaluate the 
20 normal and/or engineered NHP- expressing cells prior to their 

introduction into the patient. Such antibodies may additionally 
be used as a method for the inhibition of abnormal NHP activity. 
Thus, such antibodies may, therefore, be utilized as part of 
treatment methods . 
25 For the production of antibodies, various host animals may 

be immunized by injection with a NHP, an NHP peptide {e.g., one 
corresponding to a functional domain of an NHP) , truncated NHP 
polypeptides (NHP in which one or more domains have been 
deleted) , functional equivalents of the NHP or mutated variant 
30 of the NHP. Such host animals may include but are not limited 
to pigs, rabbits, mice, goats, and rats, to name but a few. 
Various adjuvants may be used to increase the immunological 
response, depending on the host species, including, but not 
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limited to, Freund's adjuvant (complete and incomplete), mineral 
salts such as aluminum hydroxide or aluminum phosphate, 
chitosan, surface active substances such as lysolecithin, 
pluronic polyols, polyanions, peptides, oil emulsions, and 
potentially useful human adjuvants such as BCG (bacille 
Calmette-Guerin) and Corynebacterium parvum. Alternatively, the 
immune response could be enhanced by combination and or coupling 
with molecules such as keyhole limpet hemocyanin, tetanus 
toxoid, diphtheria toxoid, ovalbumin, cholera toxin or fragments 
thereof. Polyclonal antibodies are heterogeneous populations of 
antibody molecules derived from the sera of the immunized 
animals . 

Monoclonal antibodies, which are homogeneous populations of 
antibodies to a particular antigen, can be obtained by any 
technique which provides for the production of antibody 
molecules by continuous cell lines in culture. These include, 
but are not limited to, the hybridoma technique of Kohler and 
Milstein, (1975, Nature 255:495-497; and U.S. Patent No. 
4,376,110), the human B-cell hybridoma technique (Kosbor et al . , 
1983, Immunology Today 4:72; Cole et al . , 1983, Proc . Natl. 
Acad. Sci . USA 80:2026-2030), and the EBV-hybridoma technique 
(Cole et al., 1985, Monoclonal Antibodies And Cancer Therapy, 
Alan R. Liss, Inc., pp. 77-96). Such antibodies may be of any 
immunoglobulin class including IgG, IgM, IgE, IgA, IgD and any 
subclass thereof. The hybridoma producing the mAb of this 
invention may be cultivated in vitro or in vivo. Production of 
high titers of mAbs in vivo makes this the presently preferred 
method of production. 

In addition, techniques developed for the production of 
"chimeric antibodies" (Morrison et al . , 1984, Proc. Natl. Acad. 
Sci. USA, 51:6851-6855; Neuberger et al . , 1984, Nature, 312:604- 
608; Takeda et al . , 1985, Nature, 314:452-454) by splicing the 
genes from a mouse antibody molecule of appropriate antigen 
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specificity together with genes from a human antibody molecule 
of appropriate biological activity can be used. A chimeric 
antibody is a molecule in which different portions are derived 
from different animal species, such as those having a variable 
5 region derived from a murine mAb and a human immunoglobulin 
Such technologies are described in U.S. Patent Nos . 6,075,181 
and 5,877,397 and their respective disclosures which are herein 
incorporated by reference in their entirety. Also encompassed 
by the present invention is the use of fully humanized 
10 monoclonal antibodies as described in US Patent No. 6,150,584 
and respective disclosures which are herein incorporated by 
reference in their entirety. 

Alternatively, techniques described for the production of 

single chain antibodies (U.S. Patent 4,946,778; Bird, 1988, 

fU 

fyl5 Science 242:423-426; Huston et al . , 1988, Proc . Natl. Acad. Sci . 
m USA 85:5879-5883; and Ward et al . , 1989, Nature 341:544-546) can 
be adapted to produce single chain antibodies against NHP 
expression products. Single chain antibodies are formed by 
linking the heavy and light chain fragments of the Fv region via 
2 0 an amino acid bridge, resulting in a single chain polypeptide. 

Antibody fragments which recognize specific epitopes may be 
generated by known techniques. For example, such fragments 
include, but are not limited to: the F(ab') 2 fragments which can 
be produced by pepsin digestion of the antibody molecule and the 
2 5 Fab fragments which can be generated by reducing the disulfide 
bridges of the F(ab') 2 fragments. Alternatively, Fab expression 
libraries may be constructed (Huse et al . , 1989, Science, 
245:1275-1281) to allow rapid and easy identification of 
monoclonal Fab fragments with the desired specificity. 
30 Antibodies to a NHP can, in turn, be utilized to generate 

anti-idiotype antibodies that "mimic" a given NHP, using 
techniques well-known to those skilled in the art. (See, e.g., 
Greenspan & Bona, 1993, FASEB J 7 (5) : 437-444 ; and Nissinoff, 



ru 
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1991, J. Immunol. 147 (8) : 2429-2438) . For example antibodies 
which bind to a NHP domain and competitively inhibit the binding 
of NHP to its cognate receptor can be used to generate anti- 
idiotypes that "mimic" the NHP and, therefore, bind and activate 
5 or neutralize a receptor. Such anti-idiotypic antibodies or Fab 
fragments of such anti-idiotypes can be used in therapeutic 
regimens involving a NHP-mediated pathway. 

Additionally given the high degree of relatedness of 
mammalian NHPs, the presently described knock-out mice (having 
10 never seen NHP, and thus never been tolerized to NHP) have a 
unique utility, as they can be advantageously applied to the 

O generation of antibodies against the disclosed mammalian NHP 

G 

pj (i.e., NHP will be immunogenic in NHP knock-out animals). 

p] The present invention is not to be limited in scope by the 

W15 specific embodiments described herein, which are intended as 

in 

single illustrations of individual aspects of the invention, and 
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functionally equivalent methods and components are within the 
scope of the invention. Indeed, various modifications of the 
invention, in addition to those shown and described herein will 
1^2 0 become apparent to those skilled in the art from the foregoing 

description. Such modifications are intended to fall within the 
scope of the appended claims. All cited publications, patents, 
and patent applications are herein incorporated by reference in 
their entirety . 
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